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DECIDING TERMINATION FOR ANCESTOR 
MATCH-BOUNDED STRING REWRITING SYSTEMS 


Alfons Geser^, Dieter Hofbauerq and Johannes Waldmanm 


ABSTRACT 

Termination of a string rewriting system can be characterized by termination on suit- 
able recursively defined languages. This kind of termination criteria has been criticized 
for its lack of automation. In an earlier paper we have shown how to construct an au- 
tomated termination criterion if the recursion is aligned with the rewrite relation. We 
have demonstrated the technique with Dershowitz’s forward closure criterion. In this 
paper we show that a different approach is suitable when the recursion is aligned with 
the inverse of the rewrite relation. We apply this idea to Kurth’s ancestor graphs 
and obtain ancestor match-bounded string rewriting systems. Termination is shown 
to be decidable for this class. The resulting method improves upon those based on 
match-boundedness or inverse match-boundedness. 

1 INTRODUCTION 

Automated termination criteria are vital for the processing, by machine, of string rewriting 
systems or similar formalisms. The limitations of the existing automated criteria show up 
clearly by their inability to prove, or disprove, termination of certain small string rewriting 
systems like Zantema’s System {aabb — > bbbaaa}. 

There are strong termination criteria which are not automated. Some of them character- 
ize termination through termination on a suitable recursively defined subset of the set of all 
strings. For instance, Dershowitz [3] shows that termination is equivalent to termination on 
the set of right hand sides of forward closures. Kurth [11] shows that termination is equiva- 
lent to the absence of infinite paths in the ancestor graph. Usually the constructed subset is 
infinite, so the construction cannot be automated directly. In order to enable automation, 
we seek circumstances under which these infinite sets become regular languages, as for the 
class of regular languages many useful properties are decidable. 

A string rewriting system R is called deleting , if for each letter that occurs in the right 
hand side of a rule, there is a greater letter in the corresponding left hand side modulo some 
partial order on the underlying alphabet. For a deleting system R and a regular language 
L, the set R*(L) of descendants of L modulo R is effectively regular [10]. One can go a step 
further by annotating each letter by a natural number that records its “recycling” history. 

If this number is bounded for all derivations starting from strings in L annotated with zeros, 
we say that the system is match-bounded for L. The annotated system of a match-bounded 
system is deleting; so match-bounded systems also preserve regularity. 
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We have shown in a previous paper [7] that RFC-match-boundedness, which is essentially 
the same as match-boundedness for the set of right hand sides of forward closures, is an 
automated criterion for termination. In the present paper we show that a different approach 
is capable to turn Kurth’s ancestor graph criterion into an automated criterion. 

The basic difference between criteria like Dershowitz’s and Kurth’s is the direction of 
recursion. Right hand sides of forward closures are aligned with the rewrite relation: the 
more complex ones are essentially descendants of the simpler ones. In contrast, the nodes of 
ancestor graphs are aligned with the inverse of the rewrite relation: the more complex ones 
are essentially ancestors of the simpler ones. 

It seems appropriate to use inverse match-boundedness for the task, a property that we 
introduced as a (non-)tcrmination criterion in [8] for the special case L = E*. It turns out 
that the main formal language theoretic result presented there is not general enough for our 
present purpose. We show how it has to be extended so as to serve as a basis for automating 
ancestor-graph like criteria. 

The paper is organized as follows. Section 3 and Section 4 contain a summary of defini- 
tions and basic results on deleting and match-bounded rewriting systems. In Section 5 we 
develop the formal language theoretic ingredients of onr method. Kurth’s ancestor graphs 
are defined in Section 6, and in Section 7 we describe how they can be characterized through 
inverse rewriting relations, leading to onr main result. Finally, we present a few examples 
and refer to an implementation of onr algorithm. 

2 PRELIMINARIES 

A string rewriting system over an alphabet E is a relation R C E* x E* that induces the 
rewrite relation ~^r = {(x£y,xry) \ x,y G E*, (£, r) G f?} on E*. Unless indicated otherwise, 
all rewriting systems are finite. Pairs (£, r ) from R are frequently referred to as rules £ — > r, 
and by lhs(-R) and rhs(i?) we denote the sets of left (right, resp.) hand sides of R. The 
reflexive and transitive closure of is ~^* R , often abbreviated as R*, and or R + 
denotes the transitive closure. An R-derivation is a (finite or infinite) sequence x 0 ,xi, . . . 
with X{ — > R Xi + 1 for all i. We call R terminating on L C E* if there is no infinite 17- 
derivation starting from a string xq G L. We say that R is terminating if R is terminating 
on E*, otherwise R is non-terminating. A particular kind of non-termination is caused by 
loops, i.e., derivations of the form x — uxv for strings x,u,v. For surveys on termination 
of rewriting we refer to [5, 15]. Further standard notations for strings and string rewriting 
can be found, for instance, in [2], 

For a relation p C A x B let p(a ) = {b G B | (a, b) G p] for a G A and p(A') = 1J aeA , p(a) 
for A' C A. The inverse of p is p~ = {(6, a) (a,6) G/)} CBx A, and we say that p satisfies 
the property inverse P if p~ satisfies P. Define Inf (p) = {a G A | p(a) is infinite}. The 
relation p is finitely branching if Inf(p) = 0. For the case A — B let Im(p) denote the set 
of all “immortal” elements, i.e., those a G A that initiate an infinite sequence a = cio, aq, . . . 
with ( ai,ai + \ ) G p for all i. 

For a relation p C E* x E* on strings and a set A C E let p|a = pH (A* x A*). Note the 
difference between R*\a and (R|a)* f° r a string rewriting system R. For R — {a — > b, b — > c} 
over E = {a,b,c} and A = (a, c}, e.g., we have (a, c) G R*\a, bat (a, c) ^ (-R|a)*- 

The set of descendants of a language ICE* modulo some string rewriting system R is 
R*(L). The system R is said to preserve regularity (context-freeness) if R*(L ) is a regular 
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(context-free) language whenever L is. For standard results on rational transductions we 
refer to [1]. 

A rewriting rule t — > r is context-free if \i\ < 1, and a rewriting system is context-free if 
all its rules are. Throughout we use e for the empty string and |a;| for the length of a string 
x. 

A relation s C E* x T* is a substitution if s(e) = {e} and s(xy ) = s(x)s(y) for x,y G E*, 
therefore s is uniquely determined by the languages s(a) for a G E. For a family of languages 
£ over T, the substitution s is an £- substitution if s(a) G £ for all a G E. For instance, if £ is 
the family of finite (context-free) languages, then s is a finite ( context-free , resp.) substitution. 
If e (f s(a) for every a G E, then s is epsilon-free. Finite substitutions are finitely branching, 
and the same holds for the inverses of finite and epsilon-free substitutions. 

3 DELETING STRING REWRITING SYSTEMS 

Here we recall definitions and results on deleting string rewriting systems [10], a topic that 
can be traced back to Hibbard [9]. For detailed proofs and algorithms we refer to [10]. 

Definition 1. A string rewriting system R over an alphabet E is > -deleting for an irreflexive 
partial ordering > on E (a precedence ) if e ^ lhs(-R), and if for each rule t —> r in R and for 
each letter a in r, there is some letter b in £ with b > a. The system R is deleting if it is 
>-deleting for some precedence >. 

Proposition 1 ([10]). Every deleting string rewriting system is terminating, and has linear 
derivational complexity. 

This class of string rewriting systems enjoys a strong effective decomposition property. 
As an immediate consequence, deleting systems preserve regularity of languages, and inverse 
deleting systems preserve context-freeness. 

Theorem 1 ([10]). Let R be a deleting string rewriting system over E. Then there are an 
extended alphabet T D E, a finite substitution sC £* xT*, and a context-free string rewriting 
system C over T such that R* = (s o C _ *)|s- 

Corollary 1 ([10]). Every deleting string rewriting system effectively preserves regularity. 

Corollary 2 ([9, 10]). Every inverse deleting string rewriting system effectively preserves 
context-free languages. 

In the present paper, we will need a more specialized version of the above decomposition 
theorem. 

Corollary 3. Let R be a deleting string rewriting system over E such that e f rhs(R). 
Then there effectively are an extended alphabet T D E, an epsilon-free finite substitution 
s C £* x T*, and an epsilon-free context-free substitution c C £* x T* such that 


R* = s o c . 


Note that this implies R * = R* — (s o c ) = c o s . 
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4 MATCH-BOUNDED STRING REWRITING SYSTEMS 


Many properties of deleting string rewriting systems carry over to the more general case of 
match-bounded systems. In this section, we summarize essential results from [7]. 

In order to record information about the history of letters during a derivation, we annotate 
letters by natural numbers, called match heights. The match heights in a reduct will be h + 1 
if the minimal height in the corresponding redex is h. Formally, for a given alphabet E define 
the morphisms lift c : E* — > (E x N)* for c G N by lift c : oh» (o,c), base : (E x N)* — > E* by 
base : (a, c) i— >• a, and height : (E x N)* — » N* by height : (a, c) i— > c. For a string rewriting 
system R over E with e ^ lhs(R) define the rewriting system 

match(R) = {£' — > lift c (r) | {£ — > r) G R , base(£') = £, 

c = 1 + min(height(/))} 

over alphabet E x N. For instance, the system match ({ah — > be}) contains the rules ao&o — > 
biCi, ao&i — »• biCi, o i bo — > biC\, a\b\ — > & 2 C 2 , CI 0&2 —> b\C \ , . . . , writing x c as abbreviation for 
(x,c). For non-empty R , the system match (R) is always infinite. 

Every mat ch(R)- derivation corresponds to an if- derivation (i.e., for x, y G (Ex N)*, if 
x — ^ match (H) V then base(a:) — base(?/)) and vice versa (i.e., for G E* and iG(Ex N)*, 
if v w and base(x) = v, then there is y G (Ex N)* such that base(y) = w and 
x — Wtch (R) y)- In particular, 


R* = lifto o match (/?)* o base . 

Definition 2. A string rewriting system R over E is called match-bounded for L C E* by 
c G N if e ^ lhs(.R) and max(height(x)) < c for every x G match(i?)*(lifto(T)). If we omit L, 
then it is understood that L = E*. 

The number max ( height (x)) in Definition 2 (and min(height(f 7 )) in the definition of 
match ( A) ) denotes the maximum (minimum, resp.) over the corresponding sequences of 
heights; we set max(e) = 0, and we leave rnin(e) undefined as this case is excluded in 
the definition of match (A). Obviously, a system that is match-bounded for L is also match- 
bounded for any subset of L by the same bound. Further, by the remark before Definition 2, if 
R is match-bounded for L then R is match-bounded for R*(L), again by the same bound. 

For a match-bounded system R , the infinite system match (i?) may be replaced by a 
finite restriction. Denote by match c (i?) the restriction of match (/?) to the alphabet E x 
{0, 1, . . . , c}. Then, if R is match-bounded for L by c, 

R* fl (L x E*) = (lift 0 0 match c (i?)* o base) D (Lx E*). 

Lemma 1. For all c G N, the string rewriting system match c (R) is deleting. 

Dually every deleting system is match-bounded. This connection with deleting rewriting 
makes results from Section 3 applicable. By Proposition 1, match-bounded systems terminate 
and have linearly bounded derivation lengths, and Corollary 1 implies that match-bounded 
systems effectively preserve regularity; as a consequence, match-boundedness by a given 
bound is decidable. 
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Corollary 4. If R is match-bounded for a regular language L, then R*(L) is effectively 
regular. 

Finally note that if R is inverse match-bounded, then — ^ is irreflexive, therefore Im (— > R ) 
and Inf(i?*) coincide. 

5 RESTRICTED INVERSE MATCH-BOUNDEDNESS 

In this section, we generalize our termination criterion [6, 8] from inverse match-bounded 
systems (for E*) to systems that are inverse match-bounded for a suitable language L. 

Consider the set Inf(i?* D (E* x R~*(L ))), consisting of all strings ifE* where infinitely 
many strings y G E* exist such that, for some z G L, x — ^ y —** R z. We start by showing 
that this set is regular for inverse deleting R and regular L. As we show later, emptiness of 
this set characterizes termination of R on R~*(L). The result can be carried over to systems 
R that are inverse match-bounded for L. 

Lemma 2 ([8]). Let E, T, A be alphabets, let c C E* x T* be a substitution, and let 
T C T* x A* be a finitely branching rational transduction such that also T~ is finitely 
branching. Then Inf(coT) is regular; and if c is a context-free substitution then effectively 
so. 

Lemma 3. For any inverse deleting string rewriting system R with e ^ lhs ( R) and any 
regular language L, both over E, the set Inf(R* n (E* x R~*(L ))) is effectively regular. 

Note that R* n (E* x L!) = R* n (. L ' x L') = ( — > R n (E* x L'))* for V = R~*(L). 

Proof. We have R* = c o s~ by Corollary 3, where c C E* x T* is a context-free substitution 
and s C E* x T* is a finite epsilon-free substitution. Thus R* fl (E* x R~*{L )) = cos' - , 
where s' — s Cl (R~*(L) xF). Both s' and s'~ are finitely branching since both s and s~ 
have this property. As R~*(L) is regular by Corollary 1, the claim follows by Lemma 2. □ 

Lemma 4. Let R over E be inverse match-bounded for L C E* by c, let S = match c (i? _ )“. 
Then to every infinite derivation xq — x\ — ■ ■ ■ such that x t G R~*(L) for alii > 0 there 
is an infinite derivation x' 0 x\ — > s ■ ■ ■ such that x\ G S' - * (lift 0 (A)) and base(x') = x t for 
all i > 0. Therefore, lni(R* n (S* x R~*(L ))) = base(Inf(,S* n ((S x N)* x ^-*(lift 0 (L))))). 

Proof. For every finite segment x t x i+ i ->r--- —■ >r Xj, 0 < i < j, of the given derivation 
and every witnessing derivation Xj ^* R y G L for Xj G R~*(L), we construct (by the 
remark before Definition 2) a derivation x” —>5 x" +1 — >5 • • • ^5 x" —>* s lift 0 (2/) such that 
base(x^) = Xk for i < k < j. Let the set A VJ comprise all strings x'( that can be obtained 
this way. 

Every set A t] is a subset of base -1 ^) fl height _1 ({0, . . . , c}*) by inverse match-bounded- 
ness, thus finite, and non-empty. We claim that A^ D A tJ i for all i<3< f- For, every 
derivation x" -^-* s x", —>* s lift 0 (y) such that y G L, which witnesses that x” G A VJ i , can 
also be read as a derivation x'[ — >* s x” ^* s lift 0 (y), whence it witnesses that x'( G A VJ . Let 
Ai = p| ; ->j Ay, which is therefore a non-empty set. Let us call v a successor of u if u —>5 v. 
By definition, every u G A VJ has a successor v G A i+ ij. We now show that every u G Ai also 
has a successor v G A i+1 . There are only finitely many successors of u since ->5- is finitely 
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branching. By the pigeonhole principle one of them, say v, is in infinitely many sets A+ij, 
j > i. In fact v is then in all of them, hence v G Aj +1 . Now we can finish the proof by 
choosing an arbitrary x' 0 G A 0 , and using the just proven existence of a successor x' i+l to 
every x\. This yields the wanted infinite derivation. □ 

Theorem 2. Let the string rewriting system R over E with e ^ lhs(i?) be inverse match- 
bounded for the regular language L C E*. Then Inf(i?* n (E* x R~*(L ))) is effectively regular. 

Proof. Let R be inverse match-bounded for L by c. The system S = match c (i? - ) - is inverse 
deleting by Lemma 1. Therefore Inf(S'* ft ((E x N)* x 5' _ *(lift 0 (L)))) is effectively regular by 
Lemma 3.~By Lemma 4, Inf(iCn(E* xR~*(L))) = base(Inf(S'*n((ExN)* x5 , -*(lift 0 (L))))), 
so Inf(R* fi (E* x R~*(L))) is effectively regular. □ 

We obtain our previous result as the special case L = E*. Note that e G lhs(i?) and 
e ^ rhs(-R) implies Inf(iC) = E*. 

Corollary 5 ([6, 8]). For an inverse match-bounded string rewriting system R, the set 
Inf (/?*) is effectively regular. 

6 ANCESTOR GRAPHS AND TERMINATION PROOFS 

Knrth introduced in his thesis a criterion that characterizes termination by the well-fonnded- 
ness of a recursively defined relation on strings. In this section we briefly review this ancestor 
graph criterion. 

Definition 3 ([11]). For a string rewriting system R over alphabet E we define the ancestor 
graph (' Vr,E r ), where the set of vertices Vr C E* and the set of edges E R C E* x E* are 
the least sets such that lhs(R) C Vr and, for u,v G E*, 

• if v G V R and u — v, then u G Vr and (u, v) G Er ( reduction ), 

• if rye G Vr and (£ — > r\rf) G R for strings ri, r -2 G E + , then iv G Vr and (£v, r 2 v) G Er 
(left extension ), 

• if ot'i G Vr and (£ — > r\rf) G R for strings ri, r -2 G E + , then v£ G Vr and (v£, vri) G Er 
( right extension), 

• if r 2 G Vr and {£ — > r^rf) G R for strings ri,r 2 ,r 3 G E + , then {£, r 2 ) G Er ( left-right 
extension). 

Remark 1. Observe that e G Vr is equivalent to e G lhs(R). In this case R is trivially 
non-terminating, so we can exlnde this case without loss of generality. 

Remark 2. Kurth’s definition of ancestor graphs is restricted to one-rule systems, and does 
not include left-right extension. In the one- rule case {£ — > r}, left-right extension is needed 
only in the trivial case where £ is a factor of r, and can therefore be suppressed. For many-rule 
systems, the left-right extension cannot be avoided. 

Theorem 3 ([11, Satz 4.24]). A string rewriting system R is terminating if and only if 
both e ^ lhs(-R) and from every node in Vr the length of directed paths in Er to nodes in 
lhs(.R) is bounded. 
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By a Konig’s Lemma argument for the finitely branching relation Er we get: 

Corollary 6. A string rewriting system R is terminating if and only if e f lhs(i?) and 
Im (Er) = 0. 

Remark 3. Knrth stated and proved Theorem 3 for one-rule string rewriting and claimed 
that it can be extended to the case of many rules. For a rendering of Kurth’s proof in the 
many-rule case see Appendix A. 

Example 1. The non-terminating system R = {ab — > bbaa } has the following ancestor graph, 
cf Example 3. It contains a cycle, therefore R is non-terminating by Corollary 6. 


ab 



abb < A aab 



abbb < abab > aaab 



abbbb < abbab abaab > aaaab 

-1 U Jf f. V f . V K 


Example 2 ([11]). The system R = {baab — > aababa} has the ancestor graph ( Vr,E r ) with 
V n = ba(ab) + U b 2 a 2 b 2 (ab)* U b + ba 2 b and 

Er = {(b l+ 2 a 2 b,b l+ 1 a 2 b), (ba(ab) l+ 2 ,ba(ab) l+1 \ 

(b 2 a 2 b 2 (ab) l ,ba(aby +3 ) \ i > 0} U {b 2 a 2 b, ba(ab) 2 }, 

illustrated in the following figure. It shows termination of R by Corollary 6. 



baab 



T 




baobab < 

bbaab 


T 

T 

bbaabb 

> baababab 

bbbaab 


T 

. 1 

bbaabbab — 

— > baabababab 

bbbbaab 


T 

T 

bbaabbabab — 

— > baababababab 

bbbbbaab 


7 ANCESTOR MATCH-BOUNDED SYSTEMS 

In this section we derive a new automated criterion for (non-)termination. We do it by 
instantiating Theorem 2 with L essentially being the vertex set of Kurth’s ancestor graph. 

We obtain Vr by inverse rewriting w.r.t. a rewriting system Rq constructed from R , using 
markers “(” and “)” for the left and right end of strings respectively. In the following, let 
E() denote E U {(, )} for alphabets E with (, ) ^ E. 
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Definition 4. For a string rewriting system R over alphabet E with (, ) ^ E define the 
system 


Rq = R U {£ — > (r 2 | (T — > rir 2 ) G -R,ri,r 2 G E + } 

U {£ — > ri) | (£ — > ri-r 2 ) G R,r l ,r 2 G E + } 

U -> (r 2 ) | (£ -*• r i'r 2 r 3 ) G R,r 1 ,r 2 ,r 3 G E+} 

over alphabet E^. Let denote (Rq)~ ■ 

The relation Er and the rewrite relation induced by Rq correspond to each other as we 
elaborate next. Let h : E*^ — > E* be the morphism that deletes the end markers “(” and 
and keeps all other letters. 

Lemma 5. Let e ^ rhs(i?). For x G E*^ and y G (*-V R -)*, if x — > R() y then x G (*-V R -)* and 
(■ h(x),h(y )) G E r . 

Proof. Case analysis on the definition of Rq. The case x —> R y is trivial. If x = uiv and 
y = u(r 2 v for some u,v G E*^ and (£, r-yr-f) G R then u G (* and v G v')* for some v' G E*. 
From h(y) = r 2 v' G Vr we get h(x) = iv' G Vr and (£v', r 2 v') G E n by left extension. The 

case where x = uiv and y = urf)v for some u,v G E*^ and (i,rir 2 ) G R is symmetric by 

right extension. This leaves the case where x = uiv and y = u(r 2 )v for some u,v G E*^ and 
(£, rir 2 r 3 ) G R. We get u G (* and v G )*. From h(y) = r 2 G Vr we get h(x) — £ G Vr and 
(£, r 2 ) G Er by left-right extension. □ 

Lemma 6. For x G (*-E + -)* and y' G E + ? if (h(x),y') G Er, then there is y G ( *-y '■)* such 
that x ~^ R<) y. 

Proof. Case analysis on the definition of the ancestor graph. Let x G u ■ h(x) ■ v for some 
u G (* and v G )*. In each case we have h(x),y' G V R . We get x — > R{) y and y G ( *-y '■)* by 
defining 

• y = uy'v in the reduction case h(x) —> R yf 

• y = uy')v in the left extension case h(x) = iv, y' = r 2 v, 

• y = u(y'v in the right extension case h{x) = iv, y' = vr i, 

• y = u(y')v in the left-right extension case h(x) = £, y' = r 2 . □ 

Lemma 7. If e f rhs (R), then (*-V R -)* = Rf*((* -lhs(R)-)*) . 

Proof. For “D”, we note that lhs(i?) C Vr and that (*-V R -)* is closed under rewriting modulo 
Rq. For “C” , we prove that x G Rq*({*- lhs(i?)-)*) for all x G Vr by induction on the recursive 
definition of the ancestor graph. □ 

Lemma 8. If e (f rhs(i?) , then 


MEr) = M^ R{) n (S^ X R-*{{*. \hs(R )-)* ))) n el 



Proof. We show (*-Im (E R )-)* = Im(^ R() n (E^ x {*-V R -)*)). 

To show “C”, let x' 0 ,x' 1 ,... satisfy (xfx' i+ f) G E R for all i > 0. By definition of the 
ancestor graph we get x\ G V R . Starting from xq = x' 0 one can construct Xq, X\, . . . iteratively 
by Lemma 5 such that Xi G (*-x’ i -)* and x t —■ > R Xi + 1 for all i > 0. 

To show “D”, let x 0 ,Xi,... satisfy x i+ \ G (*-V R -)* and Xi ~^ R{) %i+i for all i > 0. By 
Lemma 5 then ( h(xi ), h(x i+ 1)) G E R . □ 

Remark 4. As Example 1 shows, E R may admit cycles even though the rewrite relation 
induced by Rq does not. Hence usually Inf(EJ^) = Inf ( Rf, D (E*^ x (*-V R -)*)) fl E* fails to 
hold. 

Definition 5. A string rewriting system R is called ancestor match-bounded if Rq is inverse 
match-bounded for (*• lhs(i?)-)*. 

Theorem 4. Uniform termination is decidable for ancestor match-bounded string rewriting 
systems. 

Proof. Let Rq be match-bounded for L = (*• lhs(i?)-)*. Match-boundedness implies e ^ 
lhs(-R^), thus e f rhs(i?^). And by Remark 1 we may assume e f lhs(R), which is equivalent 
to e ^ lhs(i?o). 

By Corollary 6, the system R is terminating if and only if lm(E R ) = 0. We know 
lm(E R ) = Im(p) fl E* from Lemma 8 for the relation 

p — n (E* } x Rq*(L)) 

on E*y Since Rq is terminating on L, the relation Rq + restricted to Rq*(L) is irreflexive, so 
the same holds for Rq; as Rq is finitely branching, we obtain Im(p) = Inf(p*). By Theorem 2, 
Inf(p*) is effectively regular; note that p* = R*q fl (E(( x Rq*(L)). Therefore emptiness of 
Im(E fi ) is decidable. □ 

Example 3. Here we consider a few one-rule string rewriting systems of the form {a m b n — > 
b p a g }, see [14, 12]. The systems {ab — > ba} and {ab — > bba} are not ancestor match-bounded 
since they are not inverse match-bounded for V R = a{a + b)*b. In contrast, the systems 
R = {ab — > b n a m } for m, n > 2 are ancestor match-bounded by 2, for we have in each case 
match 3 (R^)*(hft 0 ((*-V^-)*)) = {* 0 -L-}* where 

L = aob 0 + (cio + afjafbi + a\bf(bo + b \ ) + di6 3 (d2 + &i)(cq + 62)0*61. 

Dropping the annotations yields the node set V R = ab*a*b + ab*aba*b. We have Im(EJ^) = 
ab*(a + b + ab)a*b = V R \ {a6}. None of these systems is inverse match-bounded. 

For string rewriting systems R over E define > R = {( x,y ) | 3s, t G E* : x —> R syt}. 
Then we get: 

Lemma 9. Let e f rhs(R), x G E*, y G E + , u G (*, and v G )*. If x uyv then x >* R y. 

Proof. By induction on the length k of the given derivation. The case k — 0 is trivial, so 
let k > 0 and x — u'y'v' —> R{) uyv for some u' G (*, v' G )*, y' G E + . By induction 

hypothesis, x y' ■ We get 1 / ^ R syt and so 1 / > R y by dehning 
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• s — e, t — e if the -R()-rule is from R , 

• s — ri, t — e if the ify-rulc t — > (r 2 stems from the R-rule t — > r\r 2y 

• s — e, t — r 2 if the hy-rule £ — > ri) stems from the i?-rule i — > r 1 r 2 , 

• s — r\ y t — r'3 if the R^-rule £ — » (r 2 ) stems from the i?-rulc £ — > r i r 2 r'3 . □ 

Theorem 5. Every inverse match-bounded, system is ancestor match-bounded by at most the 
same bound. 

Proof. Let S = match(i? - ) - . Note that S q = match(i?^) - where we assume for S q the 
angular brackets (o and )o- We prove that x >* s y for all x G (E x N)*, y G (E x {0}) + , 
u G (o, and v € )o* such that x ~^*s {) U U V ■ For this purpose we use Lemma 9 for the system 
S, the alphabet ExN, and the angular brackets (o and )o- □ 

Example 4. The system {babbab — * abbabbba} admits a loop of length 3. It is inverse match- 
bounded by 3 and ancestor match-bounded by 2. 

The converse is false as the following examples show. 

Example 5. The system {aaab — > aababaa} admits a loop of length 4. It is not inverse 
match-bounded but ancestor match-bounded by 3. 

Example 6. The system {baba — > ababbbba} admits a loop of length 4. It is ancestor match- 
bounded by 2 but not inverse match-bounded. 

Example 7. The system {babaa — > aaababbab} admits a loop of length 4. It is ancestor 
match-bounded by 2 but not inverse match-bounded. 

Example 8. The system {baaa — > aaabba} admits a loop of length 8 but no shorter loop [4], 
It is ancestor match-bounded by 3 but not inverse match-bounded. 

All one-rule, terminating, ancestor match-bounded string rewriting systems that we have 
encountered are also inverse match-bounded and RFC-match-bounded. However, the bounds 
may differ. 

Example 9. The system {baababa — > aababbaab} is terminating. It is ancestor match- 
bounded by 2; inverse match-bounded by 3 but not by 2; and RFC-match-bounded by 3 
but not by 2. 

The following problems are still open: 

Problem 1. Are all terminating, ancestor match-bounded string rewriting systems also inverse 
match-bounded and RFC-match-bounded? 

Problem 2. Are all non-terminating, ancestor match-bounded string rewriting systems loop- 
ing? 
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8 CONCLUSION 

Kurth’s ancestor graph criterion characterizes termination by termination on a specific re- 
cursively defined language. Such termination criteria have been criticized for their lack of 
automation. We add automation to this non-automated criterion. In doing so we introduce a 
new general technique that allows to lift similar characterizations to automated termination 
criteria. The new technique is amenable if the recursion is aligned with the inverse rewrite 
relation, like in Kurth’s ancestor graph criterion. We have done a similar construction ear- 
lier, where the recursion had to be aligned with the rewrite relation. That construction turns 
Dershowitz’s characterization of termination on right hand sides of forward closures into the 
RFC-match-boundedness criterion. 

The new ancestor match-bound criterion complements three related termination criteria: 

• match-boundedness for E* [7], 

• inverse match-boundedness for E* [8], 

• RFC-match-boundedness [7]. 

All four criteria are implemented in Matchbox [13], see 

http: //141 .57.11. 163/matchbox/. 

The four criteria have in common that they are strong: each applies to string rewriting 
systems, like Zantema’s System, that could not be proven (non-)terminating by automated 
methods earlier. Each is decidable for a given bound and, where applicable, allows to decide 
uniform termination. The match-boundedness criteria entail termination whereas the in- 
verse ones also cover some non-terminating systems. This may justify the somewhat greater 
effort for their correctness proofs. The criteria for E* have decidable termination and nor- 
malization problems, and they entail linear upper bounds on the derivational complexity in 
the terminating case. The restricted criteria (RFC-match-boundedness and ancestor match- 
boundedness) on the other hand apply to a greater class of systems. 

The ancestor match-bound criterion improves upon the inverse match-bound criterion. 
It covers more non-terminating systems, and obviously the computation for ancestor match- 
bounds is cheaper. Every terminating ancestor match-bounded system we found appears 
to be RFC-match-bounded, while the converse is not true. We trace this weakness back to 
the presence of both the left-extension and the right-extension branch in the definition of 
ancestor graphs. The definition of forward closures has only one such branch. We do not 
take the weakness of the ancestor match-bound as a weakness of the underlying technique. 
It is conceivable to design restrictions of ancestor graphs that characterize termination for 
certain classes of string rewriting systems. 

REFERENCES 

[1] J. Berstel. Transductions and Context-Free Languages. Teubner, Stuttgart, 1979. 

[2] R. V. Book and F. Otto. String -Rewriting Systems. Texts Monogr. Comput. Sci.. 
Springer- Verlag, New York, 1993. 


11 



[3] N. Dershowitz. Termination of linear rewriting systems. In S. Even and 0. Kariv (Eds.), 
Proc. 8th Int. Coll. Automata, Languages and Programming ICALP-81 , Lecture Notes 
in Comput. Sci. Vol. 115, pp. 448-458. Springer- Verlag, 1981. 

[4] A. Geser. Loops of superexponential lengths in one-rule string rewriting. In S. Tison 
(Ed.), Proc. 13th Int. Conf. Rewriting Techniques and Applications RTA-02, Lecture 
Notes in Comput. Sci. Vol. 2378, pp. 267-280. Springer- Verlag, 2002. 

[5] N. Dershowitz. Termination of rewriting. J. Symbolic Comput ., 3(1 — 2) :69 — 115, 1987. 

[6] A. Geser, D. Hofbauer and J. Waldmann. Match-bounded string rewriting systems. 
In B. Rovan and P. Vojtas (Eds.), Proc. 28th Int. Symp. Mathematical Foundations 
of Computer Science MFCS-03, Lecture Notes in Comput. Sci. Vol. 2747, pp. 449-459. 
Springer- Verlag, 2003. 

[7] A. Geser, D. Hofbauer and J. Waldmann. Match-bounded string rewriting systems. 
Appl. Algebra Engrg. Comm. Comput. 15(3-4): 149-171, 2004. 

[8] A. Geser, D. Hofbauer and J. Waldmann. Termination proofs for string rewriting 
systems via inverse match-bounds. NIA Report No. 2004-02, National Institute of 
Aerospace, Hampton, VA, LISA, 2004. 

[9] T. N. Hibbard. Context-limited grammars. J. ACM, 21(3):446-453, 1974. 

[10] D. Hofbauer and J. Waldmann. Deleting string rewriting systems preserve regularity. 
Theoret. Comput. Sci., 327 (3) :301 317, 2004. 

[11] W. Kurth. Termination und Konfluenz von Scmi-Thue-Systemen mit nur ciner Regel. 
Dissertation, Tcchnische Universitat Clausthal, Germany, 1990. 

[12] G. Senizergues. On the termination problem for one-rule semi-Thue systems. In 
H. Ganzinger (Ech), Proc. 7th Int. Conf. Rewriting Techniques and Applications RTA- 
96, Lecture Notes in Comput. Sci. Vol. 1103, pp. 302-316. Springer- Verlag, 1996. 

[13] J. Waldmann. Matchbox: a tool for match-bounded string rewriting, In V. van Oostrom 
(Ed.), Proc. 15th Int. Conf. Rewriting Techniques and Applications RTA-Of, Lecture 
Notes in Comp. Sci. Vol. 3091, pp. 85-94. Springer- Verlag, 2004. 

[14] H. Zantema and A. Geser. A complete characterization of termination of h p l q — > l r 0 s . 
Appl. Algebra Engrg. Comm. Comput., 11(1): 1—25, 2000. 

[15] H. Zantema. Termination. In Terese, Term Rewriting Systems, pp. 181-259, Cambridge 
University Press, 2003. 


12 



A KURTH’S PROOF OF THE ANCESTOR GRAPH CRITERION 

We render Kurth’s proof of Theorem 3 with small changes to account for many-rule systems 
and to improve the presentation. 

Let an infinite derivation wq — >r w\ ■■■ be given. We designate positions in this 
derivation by pairs (p, i ) where p is a position in -uy. Positions (p, i ) and (p', i + 1) are called 
residuals of each other if there are x, y G E* and (£ — > r) G R such that wy = x£y and 
w i+ 1 = xry , and either 0 < p' = p < |x| or both \x£\ < p < \x£y\ and p' — p — \£\ + |r|. Note 
that p ^ {|x|, \x£\). The residual relation is extended to an equivalence relation. For every 
j and (p, i), there is at most one residual (//, j) of (p, i). 

Definition 6 ([11, Definition 4.25]). The chain graph (V, E) of an infinite derivation 
x 0 — X\ — >r ■ ■ ■ is defined as follows. The node set V is the set of all residuals of redex 
positions of the derivation. The edge set E = E 0 U E\ is the least set such that 

• if (p',i — 1) is the residual of (p, i) G V, then (( p',i — 1), (p, i)) G E 0 ; 

• if (p', i — 1) is the redex position of the step Xi_ \ -^r x i: and (p, i) G V has no residual 
in Xi- 1 , then ((p',i - 1), (p,i)) G E v 

By definition, at each node we have a copy of the residual redex. 

Lemma 10 ([11, Hilfssatz 4.26]). The chain graph of an infinite derivation contains a 
path with infinitely many edges in Ei. 

Proof. One observes that the chain graph is a finitely branching forest of finitely many trees. 
By the pigeonhole principle, one of the trees must be infinite. By Konig’s Lemma it has 
an infinite path. If this path has only finitely many edges in Ei then there is an infinite 
ipj-path, a contradiction to the definition of V. So it has infinitely many edges in E\. □ 

We will call a path in the chain graph a chain path , and a path in the ancestor graph an 
ancestor path for short. 

Lemma 11 ([11, Hilfssatz 4.27]). Let wq -^r wi — ► r ■ ■ ■ be an infinite derivation. Then 
to every chain path that has k + 1 edges in Ei there is an ancestor path of length greater 
than, or equal to, k from a factor of w 0 to a left hand side of R. 

Proof. Let (po,0), (pi, 1), . . . , (p n + 1 , n + 1) denote the starting segment of the given chain 
path, where n > k + 1 is chosen such that ((p n ,n), (p n +i , n + 1)) is the k + 1-th edge in E x . 

We are going to show by induction on n — j that for every 0 < j < n there is an ancestor 
path from a factor of Wj that encompasses the residual redex at (ppj). The lemma then 
follows for j = 0. 

The claim holds trivially for j = n, so assume j < n. By induction hypothesis, there is 
an ancestor path from a factor w of Wj + 1 that encompasses the residual redex at (Pj+i, j + 1). 
Case 1: (p j+ ij + 1)) e E 0 . 

Case 1.1: The contractum of step Wj — >r Wj + i and the factor w neither overlap, nor is 
one a factor of the other. Then the factor w stays unchanged by the step w 3 — >r Wj + 1 , and 
we keep the ancestor path constructed so far. 
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Case 1.2: The contractum of step w 3 — w 3+ 1 and the factor w either overlap, or one is 
a factor of the other. This induces a factor w' G Vr in w 3 and an edge ( w w ) that we can 
add at the front of the ancestor path. 

Case 2: (( Pj,j ), (Pj+ 1 , j + 1)) G Ei. Then the contractum of step w 3 — >r w 3+ i and the 
factor w either overlap, or one is a factor of the other. This induces a factor w' G Vr in Wj 
and an edge ( w',w ) that we can add at the front of the ancestor path. □ 

The proof of Theorem 3 is now finished as follows: Suppose there is an infinite derivation 
w 0 W\ ^r • ■ ■ . By Lemma 10, there is a chain path that has infinitely many Ld-edges. 
So this chain path satisfies the premises of Lemma 11 for every k > 0 whence there is an 
infinite forest of ancestor paths starting from factors of wo. Since there are only finitely 
many factors in Wq, this is a forest of finitely many trees, so one of the trees must be infinite 
by the pigeonhole principle. Hence the length of ancestor paths starting from one factor of 
wq is unbounded. 
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